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We consider wavefunctions which are non-negative in some tensor product basis. We study what 
possible teleportation can occur in such wavefunctions, giving a complete answer in some cases 
(when one system is a qubit) and partial answers elsewhere. We use this to show that a one¬ 
dimensional wavefunction which is non-negative and has zero correlation length can be written in 
a “coherent Gibbs state” form, as explained later. We conjecture that such holds in higher dimen¬ 
sions. Additionally, some results are provided on possible teleportation in general wavefunctions, 
explaining how Schmidt coefficients before measurement limit the possible Schmidt coefficients after 
measurement, and on the absence of a “generalized area law’® even for Hamiltonians with no sign 
problem. 

One of the motivations for this work is an attempt to prove a conjecture about ground state 
wavefunctions which have an “intrinsic” sign problem that cannot be removed by any quantum 
circuit. We show a weaker version of this, showing that the sign problem is intrinsic for commuting 
Hamiltonians in the same phase as the double semion model under the technical assumption that 
TQO-2 hokkP. 


Positive and negative signs play an incredibly important role in quantum mechanics. For example, the basic 
phenomenon of interference relies on a cancellation between amplitudes of opposite sign. When we restrict to certain 
signs being positive, in many ways the problem seems to simplify. As an example, quantum Monte Carlo simulations 
based on a stochastic sampling of the path integral become a powerful tool in the study of interacting many-body 
systems when the Hamiltonian has all off-diagonal elements negative (such a property has long been referred to as 
“having no sign problem” in physics and more recently has been called “stoquastic” in quantum information theory). 
When we restrict to Hamiltonians with no sign-problem, it is believed that the computational difficulty of estimating 
the ground state energy becomes smaller. While this problem is QMA-complete for arbitrary local Hamiltonians 
(see Ref. U for the exact statements of what Hamiltonians may be considered), it is StoqMA-complete 2 ^ for such 
Hamiltonians with a restriction to no negative signs, and it is believe that StoqMA is strictly smaller than QMA, 
although there is no proof. 

This property of having no sign problem is, of course, a basis dependent statement. If one takes a generic Hamil¬ 
tonian with no sign problem and conjugates this Hamiltonian by a generic short-depth local quantum circuit, the 
result generically will have a sign problem. Conversely, given a Hamiltonian with a sign problem, it is quite possible 
that by some local quantum circuit, one might be able to bring the Hamiltonian to a form with no sign problem, 
simplifying the simulation. However, we may expect that for many Hamiltonians, there is no local transformation that 
will remove the sign problem. Complexity-theoretic evidence in favor of this statement is that we expect that StoqMA 
and QMA are distinct. In more detail, if it were possible for all local Hamiltonians (indeed, all local frustration free 
Hamiltonians) to find a local transformation that mapped the Hamiltonian to one with no sign problem, then we 
would have QMA=StoqMA, because we could then reduce any problem in QMA to a problem in StoqMA where 
the witness includes a specification of the local quantum circuit, and the verifier checks that the quantum circuit 
indeed maps the Hamiltonian to one with no sign problem, and then does the verification of the witness for this new 
Hamiltonian with no sign problem. If the circuit has constant depth, then the new Hamiltonian still acts on a constant 
number of qubits (one can also allow sufficiently slowly growing depth). However, it would be desirable to have a 
better understanding and to prove whether such an obstruction is present or not; further, it would be interesting to 
see if such an obstruction could happen for a gapped Hamiltonian. 

This last question is a beautiful question that as far as we know was first raised by Matthew Fisher 5 : does there 
exist a gapped local spin Hamiltonian whose ground state cannot be mapped by a local quantum circuit to a state 
that is non-negative in the computational basis? (i.e., that it is a sum of basis states in the computational basis that 
have positive coefficients) If so, let us say that “the ground state of that Hamiltonian has an intrinsic sign problem”. 
Or, more generally, if there are multiple degenerate ground states and then a gap to the rest of the spectrum, is it 
possible that there is no quantum circuit that maps the ground state subspace to a space which has an orthonormal 
basis of non-negative wavefunctions? In this case, let us say that “the ground state subspace of that Hamiltonian 
has an intrinsic sign problem”. Michael Freedman has conjectured® 1 that the sign problem indeed is intrinsic for the 
ground state (or ground state subspace) of such simple Hamiltonians as the Levin-Wen Hamiltonian Hlw for the 
double semion model (see Ref. 0] for this Hamiltonian). 

Here we should remark that there also are two weaker questions one might ask for a Hamiltonian H. The weakest 
result would be that there is no local quantum circuit which brings the Hamiltonian H to a form with no sign problem; 
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in this case we will say that “the Hamiltonian H has an intrinsic sign problem”. A slightly stronger result would be 
that for all gapped local Hamiltonians H' which have the same ground state subspace as H, the Hamiltonian H' has 
an intrinsic sign problem; in this case, we will say that “all Hamiltonians in the same phase as H have an intrinsic 
sign problem” (here “phase” refers to a particular quantum phase of matter). Note that indeed if the ground state 
subspace of a Hamiltonian has an intrinsic sign problem, then all Hamiltonians in that phase have an intrinsic sign 
problem, and if all Hamiltonians in the same phase as H have an intrinsic sign problem then H itself has an intrinsic 
sign problem, so these indeed would be weaker results. We show a slightly weaker form of this result in the next 
section. The results in that section, [l] are logically independent from later results in the paper, and the reader may 
choose to read only one or the other. 

Then, partly motivated by the attempt to show the strongest result about the wavefunction itself, we study questions 
of states in which all amplitudes are non-negative in the computational basis from the perspective of quantum 
information theory. This leads to some unusual questions. We begin with a question of teleportation: given a 
tripartite system A,B,C with A and C completely decorrelated, what possible correlations can appear between A 
and C after measurement on B2 (This can alternatively be regarded as a question of entanglement swapping.) 
Without the sign problem restriction, we can in fact create perfect entanglement between A and C. However, with 
the restriction that the wavefunction for this tripartite system has no sign problem, we find that the entanglement 
is limited. We give a detailed analysis in the case that A is a qubit, and find that as the dimension of C grows, 
the possible entanglement between A and C after measurement increases but for any finite dimension of C it is not 
possible to have maximal entanglement. In the qudit case, we give a less quantitative argument using compactness 
and still succeed in showing that the entanglement is bounded. Using these results for tripartite systems, we are then 
able to show that for a line of qudits, for a wavefunction which is non-negative in the computational basis and which 
has zero correlation length, there is a “coherent Gibbs state” form for the wavefunction. That is, in the computational 
basis the wavefunction is the Gibbs state of some approximately local Hamiltonian. We discuss ways in which our 
results might possibly be improved to show such a form in two dimensions. This would then hopefully be a step 
toward answering this question about intrinsic sign problem for the double semion state as it may be possible to 
classify topological order in such coherent Gibbs states. 

Additionally we give some other results on possible entanglement after measurement in systems without a sign 
constraint. Also, we briefly sketch the absence of a generalized area law (in the sense of Ref. Ej) for Hamiltonians with 
no sign problem. The motivation for showing that was that I believe that many problems in quantum many-body 
physics will simplify if the Hamiltonian has no sign problem so it is worth showing that they do not simplify to such 
an extent that the generalized area law holds. 


1. INTRINSIC SIGN PROBLEM FOR COMMUTING HAMILTONIANS IN THE SAME PHASE AS 

THE DOUBLE SEMION 

We now give the proof that there is an intrinsic sign problem for all commuting Hamiltonians with property TQO-2P 
in the same phase as the double semion model. The proof works for any topology, including spherical. It is natural to 
conjecture that a stronger result holds, namely that if there is a commuting Hamiltonian with property TQO-2 and 
no sign problem, then it is in the same phase as a discrete gauge theory. 

We show that 

Theorem 1.1. Consider a system containing a disk D of radius L (the topology of the system is not important). 
Then for any R sufficiently small compared to L there does not exist a Hamiltonian H' such that 

1. H' is a sum of commuting projectors, each supported on a disk of radius at most R, 

2. H' has property TQO-2 up to some length L* > L (see Ref. [7] and also defined below), 

3. there is a local unitary quantum circuit U with range at most R which maps the ground state subspace of H onto 
the ground state subspace of H', where H is the double semion model Hamiltonian, and 

f. H' has no sign problem. 

Here we define the “range” of a quantum circuit as follows: consider a circuit which involves r rounds of pairwise 
disjoint unitary gates, such that in each round each gate is supported on a set of diameter at most s; then, the range 
is rs. Note that for any operator O , conjugation by a quantum circuit of range R gives an operator supported on the 
set of points within distance R of the support of O. 

The proof is by contradiction. We suppose such that an H' and such a U did exist, and then derive a contradiction 
from lemmas |1.3|1.5|1.6| Lemma |1.3| shows the existence of operators with ceertain properties assuming such an 
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H' exists. One of these properties is that the matrix elements of these operators (in the computational basis) are 
non-negative; in this case we say that the operator is “entrywise non-negative”. Lemma 1.5 shows that a certain 


expectation value of these operators vanishes by showing that is proportional to the trace of the cube of the S matrix 
which vanishes for the double semion model. Lemma |1.6| uses the entrywise non-negativity to show that the same 
expectation value is non-vanishing, giving the desired contradiction. 

The expectation value used is related to ideas Ref. j9] In particular, we use an idea of computing elements of the S 
matrix by expectation values of what is called a “twist product” of operators (explained below). We generalize this 
twist product to multiple operators, but we continue to refer to it as a twist product. 

First, some notation. Write H' = Y/ Y Q ' x , where the sum is over sets X with diameter at most R and each Q' x is 
a projector supported on set X. Let P' x = 1 — Q' x . In general, for any set Y, we define 


p\y) = n p'x- 

IC Y 


(i) 


Similarly, write H = Yx Q x an d ^ P x = 1 ~ Q x and define P(Y) similarly. 

We now define TQO-2 for a Hamiltonian H' which is a sum of commuting projectors. Such a Hamiltonian H' has 
property TQO-2 up to some length scale L* if the following holds for all contractible sets C with diameter less than 
L*. Let Pg project onto the ground state subspace of H'. Let C\ be the set of points within distance R of C (thus, if 
X has diameter R and A' intersects C, then X C Ci). Then, the reduced density matrix of P'(C\) on C equals the 
reduced density matrix of Pg on C, up to multiplication by a scalar (the multiplication by a scalar is necessary since 
we have not normalized the traces of P'(Ci), Pg to 1). 

We note that in Ref. [7J TQO-2 was defined to be the weaker property that the kernel of the reduced density matrix 
of IIx xrP'x on G a g ree d with the kernel of the reduced density matrix of Pg on C. However, as shown in Ref. |5j 
this weaker property (that the kernels agree) implies that the reduced density matrix of P'(C 2 ) on C agrees with the 
reduced density matrix of Pg on C up to multiplication by a scalar, if C 2 is defined to be the set of points within 
distance 2 R of C. So, at the cost of increasing the diameter of X from R to 2P, we can assume the stronger TQO-2 
property; i.e, the results shown here assuming the TQO-2 property defined here will imply the same results for a 
theory with the weaker TQO-2 property up to a redefinition of R. 

Note that all the elements of P' x are non-negative; if this holds for an operator, we say that the operator is “entrywise 
non-negative”. Note also that the product of any two entrywise non-negative operators is entrywise non-negative. 
We will say that some set is “far” from from some other set if the distance d between them is large compared to R. 
That is, there will be some minimum bound on d/R ; this minimum bound will depend upon some geometric details; 
that is, we will often want it to be the case that if one takes some operator and acts on it by the quantum circuit 
it remains inside some set, but since we will have several sets that need to be far from each other, the distances will 
add. Similarly, we will say that a distance d is “large” if it is large compared to P, again with the minimum bound 
on d/R depending on geometric details. We will say something is “near” or “small” if it is not “far” or “large”. 

Let Wi be the loop operators of the double semion theory corresponding to dragging an anyon i around a given 
closed loop, with i ranging over anyon types in the theory (in this case, these are 1,6,6,66). These operators are 
unitary operators which commute with the Hamiltonian. These operators can be read off from Eq. 44 of Ref. |4| with 
W\ . W 2 , IF3, W 4 of that paper corresponding to Bq, H4, Hq, , W bl) . We will also define operators 


W[ = UW.JJK 


( 2 ) 


We also need the following lemma which relies on TQO-2: 

Lemma 1.2. Suppose H' exists obeying properties 1 — 3 of theorem \l . 1\ Consider any set Y. Let Y 2 be the set of points 
within distance 2 R of Y. Assume Y 2 is contractible and has diameter smaller than L*. Then, U'P'fYfjU < P{Y). 
That is, every vector in the image ofU'P'(Y 2 )U is in the image of P(Y). The same statement also holds if both Px 
and P' x are interchanged everywhere and U is interchanged with . 

Proof. We use the TQO-2 assumption (the Levin-Wen Hamiltonian obeys TQO-2 and H' is assumed to have TQO-2). 
Let Y-\ be the the set of points within distance R of Y. By TQO-2, every state in the image of P'(Y 2 ) has a reduced 
density matrix on Yj which agrees with the reduced density matrix of the ground state of H' on that set. Note that 
UP(Y)W is supported on Yj, and so the expectation value of UP(Y)W on any state in the image of P'(Y 2 ) agrees 
with the expectation value of UP(Y)W in a ground state of H '. Therefore, the expectation value of P(Y) on any 
state in the image of WP'(Y 2 )U agrees with the expectation value of P(Y) on a ground state of H, and so is equal 
to 1. □ 


We now construct the needed operators. These operators will be supported on an annulus. We do this construction 
for three different annuli, overlapping as in Fig.[2]or in Fig. [3} the crossings in those figures describes particular “twist 
products” defined below. 
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Lemma 1.3. Consider a system containing a disk D of radius L (the topology of the system is not important). 
Suppose H' exists obeying properties 1 — 3 of theorem 

Let A be any annulus with large inner diameter and large distance between inner and outer boundary. There exists 
an operator La supported on A such that 


L A = L° A xJ2 W(, 


( 3 ) 


where L a A is a non-vanishing operator supported on A, where L A commutes with P' x for all X, and where the sum is 
over i in {1,6, 6, bb}. where the loop defining W[ is chosen to be far from the edges of the annulus, winding around 
the annulus once. Finally. JT W( commutes with L° A . 

Further, if FT has no sign problem, then L A is entrywise non-negative. 

Further we show that L A is independent of the choice of loop so long as the loop is sufficiently far from the edges 
of the annulus. 

Proof. Let B be the set of points in A which are distance greater than 2 R from the boundaries of A. Let C be the 
set of points within distance R of A. So, B is a thinner annulus and C is a thicker annulus. Let T be the set of point 
in A within distance 4 R of the boundary of A. 

We define L A as follows. Let p' B be the reduced density matrix of a ground state of H' on B. Note that since 
Ac D, the reduced density matrix is independent of the choice of ground state if there is a degeneracy. Then, let 


L A = P'(T)p' B P'(T). 


( 4 ) 


Since P' x and p' B are entrywise non-negative, L A is entrywise non-negative. 

So, we now show Eq. Let p A be the reduced density matrix of a ground state of H on A. Since U has range 
R, the trace tr (ObP b ) for any operator Ob supported in B is equal to tr {WObU p A ). However, 


PA = tr C \ A (const, x P(C) x ^ W;Y 


( 5 ) 


as follows from known properties of the double semion model (we omit a proof). Note that the loop for Wi can be 
taken to be any loop sufficiently far from the boundary. 

Hence, since U has range R so that U^ObU C C, the trace tr (Obp' b ) is equal to const, x tr (u^ ObU P(C)(f^2, i . 

Hence, 

p' B = const, x F(uP(C)C^2Wi)U^, (6) 

i 

where T is the super-operator which traces out all sites not in B. Write F as 

J-(O) = J dVVOV\ (7) 

where the integral is over unitaries V supported on the complement of B , using Haar measure for the integration. So, 

p' B = const, x f dV VUP(C)(^ W i )t/ t V rt (8) 

i 

= const, x (£ UWiU ] ) J d VVUP(C)U ] V ] 

= (^UWiU^MB, 

i 

where we use the fact that [U(Y) i Wf)W,V\ = 0 as follows from the assumption that W r is supported far from the 
boundaries of A , and where M B is some operator supported on B defined by the above equation. 

So, 


L A = P\T)M b P\T)(J 2 UWiU f ). 


( 9 ) 
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Here we use the fact that UWiU ' has disjoint support from T, so can be commuted through P'(T ) (we know that Wi 
commutes with Px but we do not know that UWiU' commutes with P ' x ). 

Define L° A = P’(T)M B P'(T). This operator is clearly non-zero. Further, it commutes with JA W( since JA W\ 
commutes with M B . 

We finally need to show that La commutes with all P x . Clearly it commutes with all P' x for X C T. Also, clearly 
it commutes with all P x for X <£_ A. So, we consider its commutator with P x for AT in A but not in T. To show 
that this commutator vanishes, we show that for any such X, M B P X = M B = P X M B . We show M B P X = M B (the 
direction P' X M B = M B is similar). So, we must show that 

[ dVVUP{C)UW* l P' x = [ dVVUP{C)UWl (10) 


However, P x commutes with V since the supports are disjoint. So, we want to show 

UP(C)U^P' X = UP{C)U t. 

This however follows from lemma 11.21 


( 11 ) 

□ 


We now generalize the twist product of Ref. [5 In general, consider any set of operators Oi, 0 2 , 0 3 , ■■■, Ok- Consider 
any sets of sites ■■■,R n with R a n Rb = 0 if a ^ b and such that every Oi is supported on U a R a - Choose n 

permutations, nj, for j = 1 each being a permutation on k elements. From this, define the “twist product of 

Oi,...,Ok with orderings iri,...,n n on regions Ri,...,R n ” as follows. For each operator O i: write = '^ Ja Y\ a Of a , 
where the sum is over sum index a and the product is over a = 1, ...,n with Of a supported on R a . Then, the twist 
product is defined to be 


E 

a(l),a(2 ),...,a(fc) 


nKsff’ 


n a(7r a (2)) 
U ir a (2 ),a 


a 


...o 


“KW)) 

7r a (fe),a J * 


That is, we take the product of operators O i; however on each region a the operators are ordered by the permutation 
7r a . Here, 7r a (b) denotes permutation 7r a applied to b. 

This definition is independent of the choice of decomposition Oi = II a Of a- One wa y 1° see this is t° consider 
two different decompositions of a given operator. To show that they give the same result, it suffices to consider their 
difference which gives some decomposition of the zero operator. So, we need to show that if Oi = 0 for some i, 
then the twist product vanishes. For each choice of a(l), a(2),..., a(k) in the sum, the twist product is a product 
of k different operators, these operators being Jin f° r j = 1 Let us “pad” this product with k extra 

identity operators in front and back, writing the twist product as a product of 3 k operators: first, k factors of /, then 
Ila f° r 3 = 1) •••) an d finally k more factors of I. We say that there are 3 k “time slices” and in each region 

on each time slice there appears an operator either I or . Then, in any given region, we can change from 

having k factors of /, followed by for j = 1 ,..,k, and then k more factors of I, to having k\ factors of /, 

followed by 0“^“^'* for j = 1, k, and then &2 more factors of I with k\ + k 2 = 2 k. In this way, by changing how 

many factors of I appear before and after, we can shift it so that in some time slice we have 0“^ for all i. Then, 
summing over a, in this time slice we have Oi = 0 giving zero for the twist product. 

It is also worth noting that the twist product can be written in a matter that does not use decompositions. See 
for example Fig. [l] which shows the twist product of two operators written as a tensor network. Each square box 
represents an operator. The lines incoming at the bottom of the box representing the state incoming to the operator 
(i.e., the column index of the operator when it is written as a matrix) and the lines outgoing from the top represent 
the state outgoing from the operator (i.e., the row index). There are two lines as we have decomposed the Hilbert 
space as a tensor product of two Hilbert spaces, one on R\ and one on R 2 ; the left line represents one region and the 
right line represents another. 

To denote the orderings in a twist product, it is useful to use pictures, as in Ref. HJ For twist products with three 
or more operators, we can still use pictures. A figure such as Fig. [2] or Fig. [3] is used to indicate a twist product of 
three operators. Each annulus indicates an operator supported on that annulus. The over and under crossings are 
used to indicate orderings on regions where two operators overlap; in more general products where one might have 
three or more operators supported on a given site, one would again denote this with under and over crossings. So, 
on any site which in only one annulus, the ordering of the operators does not matter since the other operators act as 
the identity on that site. On any site in two annuli, we order the operators so that the operator on the undercrossing 
acts first; i.e., the ordering of the operators progresses vertically. 

One useful property of the decomposition of operators used in the twist product is that 


FIG. 1: Illustration of twist product written as tensor network. 



FIG. 2: A twist product of three operators on the three annuli. The over and under crossings indicate the orderings. This 
figure will be used below to compute the trace of S 3 , where S is the S matrix. 


Lemma 1.4. Suppose an operator O supported on R\ U ... U R n commutes with every term P' x in the Hamiltonian. 
Then O can he decomposed as Ila such that each Of commutes with every P' x except possibly for those X that 
have non-vanishing intersection with both R a and some R\, for b ^ a. 

Proof. Suppose first that there are only two regions 1,2. We decompose O using a singular value decomposition as 
O = A a J} a Of where the A a are scalars (using a Hilbert-Schmidt inner product to define the decomposition so 
that tr((0“)l0f) = tr^Of^Oj) = where 6 is the Kronecker delta). In this way, every operator Of is equal to 
tiR 2 (OOf) up to a scalar and similarly every Of is equal to tr ^(OOf). 



FIG. 3: A twist product of three operators on the three annuli. The over and under crossings indicate the orderings. This 
figure computes the trace of S 2 S , where S is the S matrix. This figure is not used in the proof, but is merely used to illustrate 
that twist products different from Fig. [2] can be used to compute other properties of the S matrix. 
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If there are multiple regions, we use an iterated singular value decomposition: if there are regions R\, ...,R n then 
we first do a singular value decomposition of O between Ri and R 2 UR 3 U.. .UR n . We then decompose every operator 
on i ?2 U J ?3 U .... U R n using a further singular value decomposition as a sum of a product of an operator on R 2 and 
an operator on R 3 U ... U R n . We then absorb the scalars (such as A a ) that result from this decomposition into a 
redefinition of the operators in the singular value decomposition. 

In this way, this gives a decomposition of O as O = )C 0: J| a 0“ such that for every there is an operator W“ 
supported on supp(O) \ R a such that 


Oa =tr supp( 0 )\fl 0 (OW“), 


( 12 ) 


where supp(O) denotes the support of an operator O. 

If X has vanishing intersection with R a , then clearly [Of,P' x ] = 0. 
intersection with Rb for all b 7 ^ a, then Eq. (12) shows that [0“,Pj<-] = 0. 


On the other hand, 


if X has vanishing 
□ 


Lemma 1.5. Consider a system containing a disk D of radius L (the topology of the system is not important). 
Suppose H' exists obeying properties 1 — 3 of theorem \l.l\ Consider operators La 2 , La 2 , La 3 constructed as in lemma 
m Then, the expectation value of the twist product in Fig. [2] is equal to zero. 

Proof. The proof is based on showing that the expectation value is proportional to the trace of the cube of the 
topological S matrix of the double semion model, which is equal to zero. In this lemma we do not use the positivity 
properties at all. 

We introduce some notation, writing IT', , W' i2 , W' 3 to denote the operators W( appearing in the constructions of 
La 1 , La 2 , La 3 , respectively. Note that 

L Ac = (1/4)L Ac ^IT' c , (13) 


for c = 1, 2, 3. So, we will instead compute the twist product of three operators (l/4)L^ c JU W' c . 

We now give a decomposition to compute the twist product. Note that if given operators M, N with decompositions 
M = Y, 0 U a M() andN = E„n„ IV“, then we have a decomposition of MN as 

mn =y.h m « n ». ( i4 ) 

a.,/3 a 


Using this identity, we separately decompose La„ and )Tb W[ a . We choose each region to contain at most one crossing 
and to contain the support of at most two of the operators LA a ■ Consider a single crossing as in Fig. [4] We say that 
this crossing contains two “arcs”, where an arc refers to some thickened line which is part of an annulus. Suppose 

La 3 > La ; 

be the terms in the decomposition on this region. From lemma 


are the two operators in this crossing. We choose a region 1Z to contain this crossing. Let L[\ ^ and \ n 


1.4 


we can choose these terms so that L ^ K 


and 


L[\ 2 n commute with all P’ x except those near the four endpoints of the two arcs. We also need some decomposition 
of W[ c . We will separately compute 4 3 = 64 twist products corresponding to twist products of the operators La 1 W' 1 , 


La 2 W) 2 , an d La 3 W( 3 and then sum over i,j, k. 




Note that W' c commutes with UPxU f for all X , so we can choose a decomposition so that every term in the 
decomposition commutes with UPxU ' for all X except those near the end of an arc. Let (W } ' c K )“ be the terms in 
this decomposition. 

Suppose further that in this crossing that A 1 overcrosses and A 2 undercrosses. Then, the ordering of the operators 
in the crossing will be 


n) 0 L\ 2 , n (W' t2tU ) s . 


(Note that i need not equal j here as i and j are being summed over). 

We now consider the action of L\ 2 n (K 2 K ) s on the ket \ijj) where ip' is a ground state of H'. Note that ip' = Uip 
for some ground state ip of H. So, we will instead consider the action of WL\ 2 n U and U^ {W'j 2n ) s U on ip. The 
action of U^ (W' 2n ) s U on a ground state of H leaves the expectation value of all Px equal to 1, except possibly for 
those X near the ends of the arc. In fact, we can decompose this operator W(W’ 12 n ) 5 U as 


U\W'^ n ) 5 U = E j D j (2,TZ), 


(15) 


where Ej acts near the endpoints of the arcs and Dj( 2, TZ) drags an anyon of type j along the path in the arc A 2 C\1Z ; we 
write the 2, TZ in parenthesis as we will apply similar decompositions to other operators and so we will need notation to 
indicate along which arc the anyon is dragged. Here the operator D m that drags an anyon of type m along some open 
loop is obtained from the definition of the loop operators W m in Ref. 0]by considering an open loop instead of a closed 


loop. This decomposition (15) can be shown as follows. The result just uses the fact that U is a local quantum circuit 


and that the operator W. t in Ref. 0] can be written as a product of commuting operators, each supported on a small 
set, that is, we can write Wj . 2 = fix O(x) for some operators 0 {m) all supported on small sets with [ 0 (m), 0 {n)\ = 0 


(we briefly explain this product at the end of this paragraph). So, W- 2 


= UWj, 2 W = UAUO(x)W). Now, we 
decompose the operator Wj 2 as a sum of products of operators supported on the different regions. Some of the 
operators UO(x)W will be supported on 7 Z; let the product of these operators be some operator 0(TZ). Some other 
operators UOW are supported on the complement of 1 Z ; let the product of these operators be some other operator 
0(TZ). Finally, there are some operators UO(x)U t supported both on 1Z and its complement; the product of these 
operators can be decomposed as a sum of products O a (lZ)O a (lZ) acting on 7 Z and its complement. So, this gives 

us a decomposition of UO(x)U 't as Y^ a (o a (TZ)0(TZ) S j (o a {TZ)0{TZ) s J . Then, 


U^O a (TZ)0(TZ)U = (u^O a {TZ)U^j (u^O(TZ)U^j. 


The second term in this product is the product of O(x) for x such that UO(x)U * is supported on 1Z\ being a product 
of the operators 0[x) it is the operator that drags an anyon along an open loop and is the term Dj in Eq. (15) while 
the first term is supported near the endpoints of the arcs and is the term Ej. Finally, we show how to write the 
operator W.; given in Ref. 0] as a product of commuting operators. The operator is given a product of operators on 
edges, “R-legs”, and “L-vertices” (to use the terminology of Ref. 0]) . This product can be written as a product of 
commuting operators on small sets as follows; consider for example the operator W 2 using the notation of Ref. [4} this 
is the operator that drags an anyon b. This operator is 


11 -i II 

edges R—legs 


n (-!)' 


L—vertices 


where s/ is defined in the reference. One can only have at most 6 L-vertices consecutively in the loop (after 6 L- 
vertices, the loop closes), so after 6 L-vertices either the loop must close or there must be an R-leg (R and L refer to 
different directions in which the loop turns). Call a sequence of consecutive L-vertices an “L arc”. We can then write 
the product above as a product 

n ( n < -1 )" n «?) n ^ n 

Lares L—verticesGL arc edgesELarc R—legs edges between R—legs 


where the product for each L arc is over L-vertices in the given arc and over edges which touch at least one L-vertex 
in the arc. The product over edges between i?-legs is over edges which touch two R-legs. Then, this gives W 2 as a 
product of commuting operators; W 3 is similar, while ITj, IT 4 are much easier to decompose. 

Now consider the action of L\^ K on the resulting state. This operator leaves all expectation values P' x unchanged 
except near the ends of the arc. Hence, by lemma 


1.2 


the action of WL^ n U leaves unchanged all Px except near 
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the ends of the arc. We now use a property of the double semion model, which we discuss further at the end of 
the section. Consider any operator O (such as IP L\ n U in this case) which is supported on some contractible set 
supp(O) (in this case, the arc) and which commutes with all terms in the Hamiltonian except those intersecting some 
other set e\ U e 2 where ei, e 2 are both contractible sets (in this case, ei, e 2 will be the endpoints of the arc). Consider 
any state <f> which is a ground state of all terms in the Hamiltonian near supp(O), except possibly those near e\ U e 2 
(for example, the state EjDj(2,TV)il) is such a state). Then, the state 0(j) is equal to '}Z m F m D m (2 1 TV)(j) for some 
operators F m , D m , where the sum is over anyon types, where F m is supported near e\ U ei and where D m (2, TV) drags 
an anyon of type m from e\ to e 2 - 

Similarly, considering the action of W K U and W(W' x K ) l3 U on the bra These can also be written as 
GiDi( 1, TV) and FTiDi( 1, TV), where Di drags an anyon of type i along the arc containing A\C\TZ and Gi and Hi are 
supported near the ends of the arc. So, we can replace these terms in the decomposition in this region with the new 
product of terms J2 l GiDi(l,TV)HiD i (l,TV)'^2 m FkD m (2,TV)EjDj{2,TV). In fact, we can make such a replacement 
in all regions, since once one acts with terms (such as J2 m F kD m (2,TV)EjDj(2,TV)) in some region on the ket, the 
state still is a ground state of all terms in all other regions, except possibly those near the boundary of those regions. 
Having done this replacement, we have a new expectation value to compute. For the product of operators in this 
expectation value, we have the identity 


Y, Gln ) H M 1, ft) F m D U 2, TV)E j D j (2, TV) = ± Y F m D m { 2, TV)EjD^2, TV) Y AA(1, ft) AA(1, ft), 


(16) 

where the sign depends upon the anyon types i,j, k, l (one must commute the dragging operators through each other). 

We write the original twist product as a sum over 4 6 terms, corresponding to the choices of i,j,k in W[ x , IT' 2 , and 
fT(. 3 , and to the choices of l,m,n in the operators which drag anyons in the decompositions of La 11 La 21 La 3 \ one 
might worry that an operator such as La, might drag an anyon of one type along one arc in the decomposition, but 
of another type along another arc. However, this gives zero contribution to the expectation value, since otherwise the 
operator produces a net topological charge at the end of an arc. 

Now, fix l, m, n and sum over i, j, k. Using Eq. (161, we re-order the terms in the twist product until the annuli are 


no longer linked with each other. This requires changing the ordering at three of the crossings. Note that the reason 
we consider the action of the terms on the ground state and show that we can replace them with equivalent terms 
is that we have limited knowledge about the form of the operators La ; however, once we show that their action on 
the ground state can be represented using the operators which drag anyons then we have an explicit form for those 
operators and can compute the commutators exactly. 

We claim that the sum over i,j,k vanishes for any l,m,n. To see this, note that after re-ordering, we can combine 
all the terms in the decomposition of IT iQ back into the original operator IT,;. ra . Hence, up to signs, what we get is 
a product of operators, with each operator in the product either acting near the end points of the arcs, or dragging 
anyons of types l,m,n along the arc, or dragging an anyon of type i,j,k around a loop. Let i! be the anyon type 
given by fusing i, l. Similarly, let j', k' be the anyon types given by fusing j, to or k , n, respectively. For fixed l, to, n, 
the sum over i,j,k causes the anyon type i',j',k' in the fused operator to range over all four possibilities; that is, if 
we fix l, to, n and sum over over i' , j', k' , this is the same as fixing l, to, n and summing over i, j, k. 

The expectation value of this product of operators is independent of i,j,k. Consider any set e which is the 
endpoint of some arc. Note that after recombining the terms in the decomposition of Wi <a back into fT ja , we are 
left with only two possible operators which might not commute with terms Px for X near e. These are terms from 
the two arcs ending near that endpoint, such as F m D m (2, TV) and a similar term F' m D m (2,TZ!) from the arc 
which is the intersection of A 2 with some other region i? 2 - The only other operator acting near the endpoints e 
is an operator Wj iQ which commutes with all terms in H. This operator IT,;. a occurs either before or after both 
terms J2m FmD m (2, TV), F! m D m {2,TZ') so the terms F m D m ( 2, ft), F! m D m {2,TV) appear between a bra and a ket 
which both have expectation value +1 for all Px near e. So, we can replace any operator O in F m.D m (2, TV) or 
F^Dm^TZ') which is supported near some endpoint e by the operator dlx.xnsuppfO)## p x) o (nA^xn S u PP (O)#0 p x) 
and obtain the same expectation value. The resulting operator, however, acts trivially on the ground state; then, 
since the Wi_ a also act trivially on the ground state, the result is independent of i,j, k. 

Hence, the only effect of i,j,k is to give an overall sign due to the sign in Eq. ( [16] ) . The sign of Eq. ( fToj ) for 
commuting i,l through j,m is the same as that for commuting i' through j', and similarly for the other signs. So 
the net sign depends only on the choice of i',j',k! and after summing over i',j',k' it gives the trace of S 3 which 
vanishes. D 


Finally, 
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Lemma 1.6. Suppose the operators La x , La 2 , La 3 in the twist product in Fig. [1| are entrywise non-negative and 
commute with all P' x for some Hamiltonian H' x which obeys TQO-2 and has no sign problem. Then, the twist 
product is positive. 


Proof. Note that for an operator that is entrywise non-negative and that acts on some set of disjoint regions R a , we 
can choose a decomposition of that operator as a sum of products of operators, with each operator in the product 
acting on only one region and such that every operator in the product is entrywise non-negative. The specific 
decomposition that we choose is as follows. For each region R ai let ipa{i) be a basis of states in the computational 
basis. Let O a (i,j) = \ip a (i))(ip a (j)\. Then, to decompose an operator La, we separately decompose each operator p' B 
or P' x appearing in Eq. Q as a sum of products of operators O a {i,j), and we use these decompositions to give a 
decomposition of La using Eq. (14) as a product of operators. 

We write a basis for the ground state wavefunctions using states that have only non-negative coefficients in the 
computational basis. We will compute the expectation value for any one of these basis states; we call this state \k 0 . 
By the disk axiom (which holds for the double semion model and also for the ground states of H 1 since they are 
obtained by acting with a local quantum circuit on the ground states of the double semion model), once we show that 
this expectation value is positive, this will imply that the expectation value is positive for all ground states. The disk 
axiom is also known as TQO-1 in Ref. ]?; and implies that all ground states have the same reduced density matrix on 
any region of diameter smaller than some L*. 

The state 'ko is a sum of basis states in the computational basis with non-negative coefficients, and we have written 
the twist product as a sum of products of operators O a [i,j) with non-negative coefficients. So, to show that the result 
is positive, it suffices to pick any one basis state 4/ in the computational basis and any particular choice of operators 
O a (i,j) appearing in the twist product and show that the expectation value of that choice of operators in state 'k is 
nonzero. We pick \k to be any arbitrary computational basis state that has nonzero amplitude in the given ground 
state wavefunction 'kg; let p be the square of this amplitude. So, the expectation value of the twist product in state 
'kg is greater than or equal to p times the expectation value of the twist product in the basis state *k, which in turn 
is lower bounded by p times the expectation value in the basis state *k of any particular term in the sum in this 
decomposition of the twist product. 

The terms in the decomposition that we choose are the diagonal terms, of the form O a (i,i); that is, we consider 
only the diagonal terms in the decomposition of each operator, either P' x or p' B . So, we must show that for each P' x 
or p ' B , that ('klPyl’k) > 0 and ('k|// B |'k) > 0. This follows for P' x because 'k has positive overlap with >ko which is 
an eigenstate of P' x with eigenvalue +1. It follows for p' B since p' B is the reduced density matrix of fkg which is equal 
to the reduced density matrix of p|'k)(\k| plus non-negative terms arising from contributions of other computational 
basis states in the wavefunction *kg. □ 


Finally, we explain the claim that in the double semion model, given any operator O which is supported on some 
contractible set supp(O) and which commutes with all terms in the Hamiltonian except those intersecting some other 
set ei U e- 2 . where e\, e 2 are both contractible sets, and given any state (f> which is a ground state of all terms in the 
Hamiltonian near supp(O), except possibly those near eiUe 2 , the state 0(f is equal to E m D m (f> for some operators 
E m , Dm, where the sum is over anyon types, where E m is supported near e\ U e 2 and where D m drags an anyon of 
type m from e± to e 2 - Consider the space of ground states of the Hamiltonian which is a sum of Qx over all X that 
are near supp(O), except possibly those near e\ U e 2 - Let us say that the “far boundary” is the set of points far from 
supp(O), and let us say that the boundary is the union of the far boundary and ei,e 2 - Consider any configuration of 
closed loops in the Z basis, except that the loops may be open on e\ or e 2 or far from supp(O), i.e., they may be open 
on the boundary. Acting on such a configuration with II Ynearsupp(o) xn(eiUe 2 )=0 §i yes a ground state if it gives 
a nonzero state, and all ground states can be obtained in this fashion because the projectors Px will annihilate any 
state with an open loop. Any two states with the same configuration on the boundary and with the configuration in 
the same relative Z 2 homology class (relative to boundary) give the same ground state. This is a known result for the 
toric code® the double semion model has the same ground states up to a sign given by —1 raised to a power equal 
to the number of surgery moves required to turn a loop configuration into an arbitrary reference configuration^ on a 
sphere, this power is simply equal to —1 to the number of closed loops while with boundaries it is more complicated 
(on some non-orientable geometries, some of the toric code ground states cannot be given a sign structure consistent 
with the double semion HamiltoniarP-; while this does not happen here, even if it did happen, it would not affect the 
claim that all ground states could be described in this way, as then the ground states of the double semion model 
would simply be given by taking some subset of these toric code states, i.e., those which can be given an appropriate 
sign structure). So, this gives us a basis for the ground state subspace, specified by where ends of open loops are 
located on the boundary and by homology class, if any. 

The usefulness of this basis is that we can now show that we can act arbitrarily in this subspace by dragging anyons 
as follows. One can detect open loops at the boundary using the operator which measures in the Z basis at the 
boundary; we can regard this operator as dragging a bb anyon around a very small open loop containing a single 
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bond. Thus, one can detect the state at the boundary by dragging anyons near the boundary. One can change which 
loops are open at the boundary by dragging an anyon of type b from one point on the boundary to another; at both 
endpoints, this has the effect of creating an open loop if there was none, and destroying an open loop if there was 
one. Thus, we can act on the configuration on the boundary arbitrarily by dragging anyons, and these operators all 
preserve this ground state subspace. We have assumed that supp(O) is contractible, but since we have also sets ei, e 2 
which might be “punctures” within supp(O), in which case in fact there might be nontrivial homology, for example if 
the set of points away from the boundary has the topology of a twice punctured disk. For a given choice of open loop 
ends on the boundary, one can change homology class by dragging an anyon b around a loop which; since supp(O) is 
contractible, such loops supported in supp(O) can all be contracted to loops near e\ or e 2 or a sum of loops around 
both. Similarly, one can deform any operator which detects homology supported in supp(O) to an operator dragging 
anyon bb around loops near ei,e 2 - Hence, the action of any operator in the ground state subspace can be written 
as a sum of operators, each of which is a product of operators dragging anyons of various types from one place on 
the boundary to another, or dragging a b around an element of homology, or dragging a bb around loops near ei,e 2 - 
Finally, once we have written an operator as a sum of products of operators dragging anyons, we deform the paths 
along which the anyons are dragged until we have only (for an operator supported on supp(O)), anyons dragged 
near e\ or e 2 or from e± to e 2 - Any operator supported on supp(O) cannot change which loops are open on the far 
boundary, so we can restrict to sums of operators which do not drag b from one point on the boundary to another. 
Further, any operator supported on supp(O) commutes with all operators supported on the far boundary, so they can 
be sensitive only to the parity (even or odd) of open loops on the far boundary; however, the parity of open loops on 
the far boundary corresponds to dragging bb around the far boundary which can be contracted to dragging bb around 
loops near e 2 ,e 2 - So, the operator supported on supp(O) can be written as a sum of operators dragging anyons of 
various types from some point near e± or e 2 to some point near e\ or e 2 - For each term in the sum, we can compute 
the change in topological charge near e±: we compute the total anyon charge dragged from e\ to e 2 ; this is 1,6,6, or 
bb. Then, we can write that term in sum as an operator D m which drags an anyon with the given total anyon charge 
from ei to e 2 multiplied by an operator E m supported near e.\ U e 2 - 


A. Remarks on Computing the S Matrix From the Ground State Wavefunction 


One might wonder whether this proof above could be adapted to prove the stronger result that the ground state of 
the double semion model has an intrinsic sign problem. In the proof above, we used the operators P' x defined from 
H'. Unfortunately, if we are just given some wavefunction To that is obtained by acting with a local quantum circuit 
on a double semion model ground state such that To is non-negative in the computational basis, there does not seem 
to be any easy way to construct a commuting projector Hamiltonian with no sign problem such that To is a ground 
state of that Hamiltonian. Note that given a density matrix p that is entrywise non-negative, the projector Q onto 
the kernel of p may have a sign problem. That is, a positive wavefunction might not have a parent HamiltoniarP® 
without a sign problem. 

Another possibility would be to instead define the operators La to simply equal La = p' B , without the extra 
projectors as in Eq. (§• In this case, it is still possible to prove a lemma like lemma [U6} However, proving a lemma 
like lemma 1.5 become more difficult. The operators p' B do not commute with the Hamiltonian near the boundary 
and so they may create additional anyonic excitations near the boundary. This complicates the calculation of the 
commutation relations of these operators, complicating the calculation of the twist product. It is not clear whether 
or not a result like lemma 1.5 would still continue to hold (perhaps one can at least show that the expectation value 
is small). 

However, this may still give a way of computing the trace of S 3 (or other properties of the S matrix) from the 
ground state wavefunction. Let p' Bi , p' B2 , p'b 3 be reduced density matrices on three annuli as in Fig.pl While it may 
not be true in general, it is reasonable to hope that for physical systems that the expectation value of tne twist product 
of these reduced density matrices is close to the trace of S 3 multiplied by the expectation value of their untwisted 
product (i.e., the expectation value of Pb 1 P I b 2 Pb 2 - This may be accessible using quantum Monte Carlo and may give 
an alternative method to that suggested in Ref. £[§] to extract the S matrix from the ground state wavefunction. 


2. TELEPORTATION 

We next turn to questions of positive wavefunctions, without any assumptions about an underlying Hamiltonian. 
As a preliminary, we consider teleportation in tripartite wavefunctions, before proceeding to larger systems. 

Consider a tripartite system, with parts labelled A,B,C and corresponding Hilbert spaces Consider 

a wavefunction with amplitudes ^>(«,j, &) i n a product basis |z) (g> |j) (g> |fc) with the property that the wavefunction 





12 


is non-negative in the computational basis, i.e. that 

\/i,j,k ip(i,j,k)> 0. (17) 

For notational shorthand, we refer to a wavefunction which is non-negative in the computational basis simply as a 
“non-negative wavefunction”. Assume further that the density matrix pac on AC factors as 

Pac = PA® Pc- (18) 

We now consider the question of “teleportation” in such a wavefunction: after a projective measurement on B 
in the same basis of states | j), what entanglement can be produced between A, C? Note that without the non¬ 
negativity constraint, it possible for A , C to be maximally entangled: consider a system of four qubits, with A 
being the first qubit, C being the fourth qubit, and B comprising the second and third qubits. Consider the state 
(1/2)(100) + 111)) 8 (100) + 111)). Then, measure B in a Bell basis. However, while the wavefunction is non-negative 
as written down, it is not non-negative if we use a Bell basis for B. 

We will show that the non-negativity constraint has important consequences. 


A. Without Positivity Constraint 

We begin, however, by considering the problem of teleportation in the absence of the non-negativity constraint, for 
which wavefunctions such teleportation is possible for some choice of basis on B. Since in this subsection we allow 
arbitrary bases on B for the measurement, we are in fact allowing arbitrary basis rotations on subsystems A,H,C. 
Hence, the state (after rotation) can be written as 


"0 = i>A,B L 8 ^B R ,C, 


(19) 


where the Hilbert space Hb on B is a tensor product of two Hilbert spaces, Hb l Hb r , and 4>a,b l ,'4 , b r ,c are 
wavefunctions on the pairs of Hilbert spaces Ha ®Hb l and Hb r 8 He- In fact, condition (18 1 is equivalent to the 
statement that, up to rotation, condition (19) above holds. 


The two wavefunctions i/a,b l , ^b r ,c are fully parametrized, up to rotations on A,B,C , by their Schmidt coeffi¬ 
cients. Let A(l) > A(2) > ... be the sequence of Schmidt coefficients for i/a,b l and let q(l) > q(2) > ... be the sequence 
of Schmidt coefficients for i/>b r ,c- We will ask, under which conditions is it possible that e to achieve the situation 
that the average entanglement entropy between A and C after measurement is equal to the entanglement entropy 
between A and BC before measurement. Let pa be the reduced density matrix on A and let Pa(j) be the reduced 
density matrix on A conditioned on measurement outcome j. Let measurement outcome j occur with probability P^. 
Since pa is a convex combination of paU ), by convexity of von Neumann entropy, it follows that the entropy of pa 
is greater than or equal to the average entropy of paU)' 


S(p A )>Y. P f S (PA(j)), 


( 20 ) 


where S(...) denotes the von Neumann entropy of a density matrix. Further, equality can only be attained in the case 
that 


Pa = PaU) (21) 

for all j such that P ) s > 0. Hence equality can only be achieved when, for every measurement outcome on B , the 
sequence of Schmidt coefficients between A and C of the resulting state is identical with the sequence A. We will 
assume without loss of generality that pA has no zero eigenvalues. 

We look for conditions under which this is possible, allowing ds to be arbitrarily large (equivalently, we can take 
ds = dAdc and then allow an arbitrary POVM measurement on B rather than a projective measurement). We claim 
that 


Lemma 2.1. For any given pa and pc, there exist states if which obey Eq. (lA) and Eq. (21) if and only if the 


sequence of eigenvalues of pc is majorized by the sequence 1/dA, 1/dA, ••••, 1 /dA, 0 ,..., where there are dA non-zero 
entries and the remaining dc — dA entries are zero. 


Proof. Let us denote this sequence 1/dA, ■ as r. Here, we say that one sequence a majorizes another sequence b if, 
when the entries are sorted in descending order so that the sequences are ai > 02 > 03 > ... and bi > 62 > 63 > ... we 
have ELi > J2k =1 for a11 l - 





13 


We first show this claim in the case that p A is maximally mixed so that all eigenvalues are equal to 1 /cIa- 
Consider first the direction of showing that this condition is necessary. Let pc(j) denote the reduced density 
matrix on C after measurement outcome j, so that pc = Pf Pc{j)- We have that all pc (j) have the same 

eigenspectrum (namely, the sequence r), and hence pc{j) = Ujpc(j = 1 )Uj, for some unitary matrices Uj, so 

pc = Yhj Ps{j)UjPc{j = Since the sequence eigenvalues of pc{j) is equal to r, by Uhlmann’s theorenf®, this 

condition on pc implies that the sequence r majorizes the sequence of eigenvalues of pc, as claimed. 

Now we show that this condition is sufficient. Again by Uhlmann’s theorem, if the sequence r majorizes the sequence 
of eigenvalues of pc, then we can write pc = P( v )pc(v) f° r some sequence P(v) which defines a probability 
distribution, and some sequence of matrices pc{v) which all have eigenspectrum r. Let this sequence have j take one 
of K possible values for some K. We now given a specific state ip which realizes this set of pc(v ) while still obeying 
condition (181. We pick ds = d 2 A K. We label basis states on B by a triple (f, u, v), with 0 < t < d A — 1,0 < u < d A — 1, 
and 1 < v < K. For each v, let pac(v) be a maximally entangled state between A, C such that the reduced density 
matrix on C is pciy). We pick 


V> = 


, dA 

t,u : v 


V p ( v ) z aXa\(^u,v)) b <B> pac(^ ’), 


( 22 ) 


where Za is a diagonal unitary on A with diagonal entries 1, exp(27r/dA), exp(A tt/ d a), ■■■ and Xa is the “shift” 
operator on A: it is a unitary matrix with 0 entries everywhere except for entries 1 on the superdiagonal and in 
the bottom left corner. To verify that this obeys Eq. (18), it suffices to check that for each v. the state ip = 


Y. h uk Zt A X \\{t,U,v)) B < 8 > Pac{ v ) obeys this condition; this can be verified directly. 

We now consider the general case in which the state pa is not maximally mixed. We will proceed by reducing this 
to the previous case. If pa has some number of zero eigenvalues, we restrict to the Hilbert space orthogonal to those 
zero eigenvalues, so we can assume that pa has no zero eigenvalues. Define the state ip by 


= z 1/2 p A 1/2 ip, 


(23) 


where Z is a normalization chosen so that \ip\ 2 = 1. This define a new state ip which obeys the factorization condition 
and on which the reduced density matrix p A is maximally mixed. Further, let p A {j) denote the reduced density 
matrix on A for state ip after measurement outcome j. We have, from Eq. (21), that p A = Pa{j)- Hence, by the 
above results applied to state ip, we find that the density matrix pc is majorized by r. □ 

Finally, we remark that this calculation we have done amounts to a bound on the so-called “entanglement of 
assistance”® for a particular class of mixed states, p A c = PA® Pc- As shown in Ref.|T51 the asymptotic entanglement 
of assistance for such a state (asymptotic in the sense of given many copies of the state) is equal to min(S(p A ), S(pc)), 
while here we see that for a single copy the bound can be smaller. 


B. With Positivity Constraint For A Being A Qubit 

We now consider the case under which the non-negativity constraint is imposed. We first investigate the same 
question as above: when is it possible to achieve the situation that the average entanglement entropy between A and 
C after measurement i s eq ual to the entanglement entropy between A and BC before measurement? So, as above we 
may assume that Eq. (21) holds. Without loss of generality we assume indeed that all P > 0. We again allow ds 
to be arbitrarily large. 

We begin with some generalities before specializing to the case d A = 2 in the lemma at the end. For given outcome 
j, the resulting normalized wavefunction is equal to 


1 


53 —j=^B^ k )\ i ) ® \k) = 53 I*) ® Mi)): 


(24) 


i.k 


where we define 


Viti)} = 53 ^=' i /’(bi:fc)M 


P? 


(25) 


Hence, the condition Eq. (21) is equivalent to 


Mi)M(i)) = (PA)im, 


( 26 ) 
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for all j, where the right-hand side denotes the l,m matrix element of pa- 
From the factorization equation Eq. © we have that 

h(i))(«m(j)| = ( PA)lmPC■ 

0 


( 27 ) 


For a vector such as Vi{j), let |u;(j)|i denote the 1-norm of the vector, namely the sum of the absolute values of 
the entries of the vector. This is equal to J2 k i’i.hj, &). For a matrix p, let \p\i denote the sum of the absolute values 
of the entries of the matrix. We use superscript . 7. to indicate that we regard the matrix as a vector and then apply 
the 1-norm for that vector, to distinguish this from the usual 1-norm of a matrix. 

We will use this |.7.|i norm frequently, so let us pause to motivate it. Suppose we have some non-negative wave- 
function <p on ABC , with given dc ■ Let oac denote the reduced density matrix of <p on AC. Suppose that <tac does 
not factorize as a a <8> cre- Let us try to construct a state ip that will obey the factorization condition. The state ip 
will have a Hilbert space on B that is the tensor product of the Hilbert space of <p on B with a new degree of freedom 
taking dc ! possible values. Let ip have amplitude 

ip(i,(j,n),k) = (28) 


where here the notation (j. tt) denotes a state on B labelled by a pair j = 1,2 and tt defining a permutation, 
and tt (fc) is the permutation applied to the element tt. Then, the reduced density matrix pac of ip is given by 
Pac = 12tt n C^AC'^c 1 , where nc is the operator corresponding to the permutation tt applied to C. This can be 
understood as applying a quantum channel to cfac to obtain pac- This quantum channel has two invariant subspaces. 
One is the one-dimensional space parallel to the vector (1,1,...) with all entries equal; the other is the dc — 1 space 
perpendicular to that. The quantum channel maximally mixes in each subspace. Hence, to verify Eq. (18), it suffices 
to show that 


trc(pAcn(i,i,...)) a pa , 


(29) 


where i ) projects onto the space parallel to the vector (1,1,...). However, the trace of a density matrix with 
LI(i,i,...) is simply proportional to the |.7.|i norm of that density matrix, motivating our study of this object. This 
construction can be understood as follows: we have added some additional degree of freedom to B (taking dc\ possible 
values) and used this degree of freedom to apply a unitary transformation to C, thus trying to make C more mixed to 
help satisfy Eq. (18). However, we choose these unitaries to be positive (and hence permutations) in order to maintain 
positivity. 


From Eq. (27) we find that 


^2Pf\vi(j)\i\v m {j)\i = (, pA)im\pc\i 


(30) 


It is in this step that we used the non-negativity assumption: for any given j the |.7.|i norm of \vi(j))(v m (j)\ is equal 
to |uj(j)|i|‘t>m( 1 7')]i) and because of the non-negativity assumption, the |.7.|i norm of JT Pf\vi(j))(v m (j)\ is simply 
the sum of the norms for each j. 

Without loss of generality, we may assume that \vi(j)\ = |w(j)| = 1 /y/dA for all j. To see this, consider the initial 
state ip(i,j, k). If the probability of having i be in any given state is equal to zero, the problem reduces to a problem 
with dA reduced by 1. So, we may assume that the probability of being in any given state on A is nonzero. Then, we 
consider a new state ip’(i,j,k) given by ip'(i,j,k) = f(i)ip(i,j,k ) where /(*) is some function chosen to ensure that 


ip' has equal probability of each state on A. Then, Eq. (18) for ip is equivalent to Eq. (18) for ip' and the condition 
of equality of entropy of ip is the same as that for ip'. 

For notational simplicity, let Vi(j) = VdAVi{j)- This is just done to normalize all vectors to 1. We have from 
Eqs. ( 27|30 ) that 


j 


(31) 


Note that since Eq. (21) holds, the right-hand side is simply equal to \pc\i(PA)im as {v m (j)\vi(j)) is independent of 
j. In the special case that l = m, Eq. (311 gives that 


l®iC/)li = \Pc\\ 


( 32 ) 
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Hence, Pf (\vi(j)\j + |S m (j)|f) = 2|pc]i and so by Eq. (311, 


, Y:jPf\viU)\i\v m U)\i 
' Y,j P F (\vi(j)\l + \Vm(j)\l 




(33) 


Now, let us consider for which cases Eq. (33) can be satisfied. That is, given a specified value for dc and for 


(vm(i)\yi(j)) (recall that this inner product is independent of j), is it possible to satisfy Eq. (33)? The left-hand side 
of Eq. (33) is a ratio of two sums. By adjusting the probabilities P we will be able to satisfy the equality unless it 


is the case 


that for all j we have that 


First Case : 2 


or it is the case that for all j we have that 


\vi{j)\i\v m (j)U 


\vi(j) I 






Second Case : 2 


l^(j)|l|^m(j)|l 

l^(i)ll + |6m(i)ll 


< {Vrn.(j)\vi(j))- 


(34) 


(35) 


In fact, what we will find is that for sufficiently small dc it may not be possible to satisfy Eq. (33), because 
in fact for small enough dc it might not be possible to find two non-negative ve ctors v,w, with |u| = |ic| = 1 
(u|tu), so that we will always be in the first case above (341. Note, it is always possible to 

' |!>b+H 2 > ^ v \ w ') f° r an y ( v \ w ) < 1 and any dc > 2 (obviously, if 


such that 2p)4(n^w < 

Mi+Mi “ 

find two non-negative vectors v,w such that 2 7 


dc = 1 then there is no way to find two vectors with inner product less than 1). Simply choose v = (1, 0, 0,...) and 
w = (cos(0),sin(0), 0,0, ...0). A direct calculation shows that indeed the inequality holds. 

In general, given a dimension dc and a value (u|u;|), we say that “the pair dc, (v\w) is feasible ” is it is possible to 
achieve 2 < {v\w} for two vectors v,w with non-negative entries. In the appendix, we consider which pairs 

are feasible. We find that for dc = 2, it is not possible; however, for dc > 2, it is possible to achieve this, so long as 
(v\w) is not too small. The possible values of (v\w) tend to zero as dc tends to infinity. 


We finally restrict to qubits and show that the ability to achieve this inequality 2 
restriction, if ds is allowed to be large enough. That is, 


MiMi 
\v\i+\w\i - 


< (u|w) is the only 


Lemma 2.2. Let pa be a density matrix with non-negative entries for A being a qubit. Suppose that pa has equal 
probability to be in the up or down state. Let Xa denote the Pauli X operator on A. For a given dc, there exist a ds 
and a non-negative wavefunction ip on ABC obeying Eqs. 1 18\21 ) if and only if the pair dc,tr(pAXA) is feasible. 

More generally, if p a has probabilities P|,P| / 0 o/ being i n the u p or down state. For a given dc, there exist a 
ds and a non-negative wavefunction ip on ABC obeying Eqs. fillip if and only if the pair 


dc, 


tr (p A X A ) 

2/Prn 


is feasible. 

Proof. The general case can be reduced to the first case by considering a state ip' with amplitudes ip'(i,j,k) = 
(2P i A ) _1 / 2 '0(i, j, k), where P/ 1 is the probability of being in state i on system A. 

Let v(j) = u t (j) and w(j) = v±(j). Now, if Eq. ([21]) holds, then (v{j)\w(j)) = tr (p A X A ). If the pair d c , tx{p A X A ) 
is not feasible, then Eq. (34) holds for all j, and so Eq. (33) cannot be satisfied. That shows the “only if’ direction. 
Now we show the “if’ direction. Suppose the pair dc, tr ( paXa) is feasible. Then, we can choose two pairs of vectors 
u(l),w(l) and v(2),w(2), with all vectors normalized to unity and (ui|roi) = = tr^^X^) such that the pair 

u(l), izj( 1) has 


o Ki)lik(i)li 

Ki)li + Ki)l? 


> («(i)|w(i)) 


(36) 


and 


b(2)|i|w(2)|i 

K2)|f + K2)|? 


> («(2)|«;(2)) 


(37) 
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and find probabilities P B (1),P B (2) such that 


T,j p f {yU)\i + Hi)]? 




(38) 


Eq. (28). Then, our calculations with Eq. (33) simply amount to verifying Eq. (29) 

1 1 norm of that density matrix. 


with II( ljlj ) is simply proportional to the 


So, these probabilities can be used to a give a state with ds = 2 that obeys Eq. (33) and Eq. (21). Call this state <j>. 
However, such a state </> will not, in general, obey Eq. 18). As above, define a state ip which has ds = 2 • dc !, using 

as the trace of a density matrix 

□ 


C. Qudits, Entanglement Decrease 


We now consider the qudit case. In this section, we show that 

Proposition 2.3. Consider any finite dA,dc- Then there is a constant c < 1 such that the following holds. Assume 
Eq. (18) holds. Assume i/>(i, j, k) is non-negative. Then, 


^ Z P J BS (PA{j )) < c-ma x(S(p A ),S(p c )- 


(39) 


We conjecture that this result can be improved to show that the constant c can be chosen independently of dA,dc- 
In the discussion, we explain why this stronger result would be more useful. 


To prove this proposition, we first show a simple result that replaces the inequality < in Eq. (20) with a strict < 
inequality: 


Lemma 2.4. Consider any finite dA,dc- Assume Eq. (18) holds. Assume ip(i,j,k) is non-negative. Suppose that 
S(pa),S(pc) > 0 . Then, 


^PfSiPAti)) < ma,x(S(p A ),S(p c ). 


(40) 


Proof. Suppose not. Let %p be a density matrix such that equality holds with S(pa) = nra x(S(pa), S(pc ))• Then, we 
must have that S(pa) = S(pc)- We first show that pa and pc must both have the property that they are proportional 
to projectors and that they have the same rank; that is, there is some A > 0 such that all eigenvalues of pa and pc 
are either equal to A, or are equal to 0 . To see this, note that equality S(pa) = max(5'(p / i), S(pc)) can only hold 
if pa (J) is independent of j and equal to pa and similarly, pc{j) must also be independent of j and equal to pc- 
Suppose pa has eigenvalues Ai > A 2 > ..., with corresponding eigenvectors V\,V 2 ,--- The matrix pc must have the 
same eigenvalues Ai, A 2 ,... Then, in the eigenbasis of pa and eigenbasis of pc, 


V’C?) = V^U(j) a p\a) A « 


(41) 


where iffj) denotes the pure state on AC conditioned on measurement outcome j and where \a) a, \P)c i n this equation 
denote basis vectors in the eigenbasis of pa, pc- The matrix U(j) is a unitary matrix which commutes with pa', that 
is, Ua is block diagonal in this eigenbasis, only mixing basis vectors with the same eigenvalue. Suppose that pa is 
not a projector. Choose any two distinct nonzero eigenvalues A> > A< > 0. Let n^z) project onto the subspace of 
eigenvectors of pa with eigenvalue x and let nc(x) project onto the subspace of eigenvectors of pc with eigenvalue 
x. Then, we find that 


Tr {{pac - PA < S > Pc)nA(A>)n c (A<)) 7 ^ 0, 


contradicting the assumption Eq. (18); to see that this is nonzero, note that it has the same nonzero value for any 
choice of U. 

So, indeed pa, Pc must both be proportional to projectors and must have the same rank. Recall that pa has 
non-negative entries. We claim that a projector with non-negative entries must be block diagonal, and in each block 
must either be zero or have a single nonzero eigenvector; this eigenvector is a non-negative vector v and the projector 
in that block is the outer product of v with itself. That is, such a projector P can be written as 




P = 


P 2 


Ps 


(42) 
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where each Pa is a block matrix which is either equal to zero, or has matrix elements ( Pi) a b = v a Vb for some non¬ 
negative vector v with norm 1. To see Eq. (42), pick any vector w which is 1 in some coordinate and 0 in all other 


coordinates. Consider Pw. If Pw is zero, then the w-th row and column of P is equal to zero, giving us one of the 
zero blocks of Eq. (42). If Pw is nonzero, then the result is some eigenvector of P with eigenvalue 1. Normalize this 


vector and call it v. Let C be the set of coordinates on which v is nonvanishing. We claim that this gives us one of 
the nonzero blocks of P. To see this, consider any normalized eigenvector of P orthogonal to v. Suppose that this 
eigenvector w is nonvanishing on some coordinates in C. Then, since v and w are orthogonal, w must be nonzero and 
positive on some set of coordinates C+ C C and nonzero and negative on some other set of coordinates C- C C. Then, 
P must have vanishing matrix elements between all coordinates in C + and coordinates in C~ (if not, by replacing 
w with a new vector obtained by replacing all entries of w with their absolute value, we would find a vector w' with 
{w' ,Pw') > 1, which is a contradiction). However, this contradicts our assumption on how C was obtained: P has 
a nonvanishing matrix element between the coordinate in which w is nonzero and every other coordinate in C and 
so P has nonvanishing matrix elements between any pairs of coordinates in C. Here we use a property of projection 
operators with non-negative entries: if matrix element P a b and P& c > 0, then P ac > 0; this follows from the fact that 
P 2 = P. 


So, suppose that (1 /A)pa has the form (42). Let | a) a denote basis vectors for states on A such that 


| a) 


a{u\ = 


/0 


V 


Pv 


\ 


.../ 


(43) 


Let | a)c be similar states on C (note that the matrices P a will mean different things depending on whether we are 
referring to A or C ). Then, if = j p J> P)W)a 8 \j )b 8 \P)c f° r some if(a,j, /3), where in an abuse of notation 
ip(a,j,p) refers to a possibly different function from the original ip(i,j,k). Further, ip{a,j,f 3 ) is still non-negative. 
Both pa and pc are proportional to the identity matrix in this basis, as each block of pa, Pc in the original basis 
becomes a single diagonal entry. By Eq. (18), pa® Pc is also proportional to the identity matrix in this basis. Hence, 


by the non-negativity of if(a,j , /3), it follows that if if(a,j, f3) > 0 for some specific choice of a,j, /? then 3') = 0 

unless a = a 1 and /3 = /?', as if this were not true then pa <8 pc would have non-vanishing off-diagonal entries. Hence, 
after the measurement of j, the states A and C are completely unenentangled, rather than maximally entangled. O 


We now show a result on how the entropy behaves in the case that pa , Pc each have one large eigenvalue and all 
other eigenvalues small. Note that this next lemma does not make use of the nonnegativity assumption. 

Lemma 2.5. For any finite d,A,dc there is an e > 0 and a c < 1 such that the following holds. Assume Eq. | 
holds. Suppose that both pa-, Pc each have the property that all but one of their eigenvalues are less than or equal to 
e. Then, 


Y,P?S(PaU)) < c-max(S{p A ),S{pc). (44) 

j 

Proof. Let the second largest eigenvalue of pa be A2(^4) and let the second largest eigenvalue of pc be A 2 (C). Assume 
without loss of generality that A 2 (A) > A 2 (C). Then, rna x(S(pa), S{pc) > — A 2 (A) log(A 2 (A)). Let be the pure 
state on AC conditioned on outcome j. Let n sma ;/,A denote the projector on A orthogonal to the largest eigenvalue 
of pa and let A srna ii,c denote the projector on C orthogonal to the largest eigenvalue of pc- 
For any given j, let P 3 maii(j ) be given by 

Psmall(j) ~ (VKj) \n s mall,A 8 (1 n small,c') A (f Ll sma /; ; A) 8 ^dsmall ,C \ ( J ^) (^b) 

and let Psmall,smaii{j) be given by 


Psmall,smalli,j ) — (VKj ) \^dsmall,A^d.small,C \ I^(j^)) • (^b) 

That is, Psmall,smaii{j) is the probability that both A, C are in the “small probability” space (the space orthogonal 
to the largest eigenvalue) while P sm aii{j ) is the probability that one of them is in the small probability space. Note 
that 

pf PsmallU) = {if Ismail,A 8(1- H sma ll,c) + (1 ~ A sma ll,A) ®Tlsmall,c\^) < (d/t ~ 1) A 2 ( A) + {dc ~ 1) A 2 (C), (47) 

3 
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and 


Hence, 


where 


Psmall,small(j) < (^A — l)(dc ~ 1) A 2 (A) A 2 (C) . 
3 


E P 3 B S(PaU)) 


= £ p / 


S(pA{j)) 


PsmallU) 

^2(A)-\-\2(C) 


^2(A)\2 (C) 


{ 


Psmall (j ) 


P, 


small,small 


O') 


(A VA 2 (A) + A 2 (C) A 2 (A)A 2 (C) 


< M ■ {{dA - 1) + (dc — 1) + (dA - 1 )(dc — 1)) > 


(48) 


(49) 


M = max^ 


S(pa(<P)) 


PsmaU(<t>) 

^2 (^.) + A2 (C) 


Psmall,small{4 >) 

A 2 (A)A 2 (C) 


(50) 


where the maximum is taken over all pure states 4> on AC , with S(pa{4>)) being the entropy of <f> on A, and P S maii(,<fi ) = 

(0|n small, A 0 (1 n small, o') ~t~ (1 n small, A ) & H small ,c|0): and Psmall,small (0) = 0 H 5ma ^ ) C'|0). The 

term multiplying M in Eq. (49) is a constant, independent of A 2 (A),A 2 (C). We will show that M is bounded by a 
constant (depending on dA,dc) times A 2 (A). Then, since S(pa) > —A 2 (A) log(A 2 (A)), for sufficiently small A 2 (A) the 
logarithm will be arbitrarily larger than the constant in the bound on M and so the lemma will follow for sufficiently 
small e. 

We can reduce this problem of maximizing over (j> to a problem of optimizing over states <j> on a two dimensional 
Hilbert space, because the entropy of any <j> is bounded by a constant (this constants depends upon dA,dc ) times 
the contributions to the entropy summer over the two largest eigenvalues of p. Thus, we can reduce to the Hilbert 
space spanned by the two largest eigenvectors on A (and similarly reduce C). Then, the optimization over such two 
dimensional <f> can be done straightforwardly. □ 


Now we can show Proposition |2.3| Consider a fixed dA,dc■ Consider the set of pairs of density matrices pa, pc 
such that the second largest eigenvalue of pa is greater than or equal to e (where e is chosen from lemma 2.5) or the 
the second largest eigenvalue of pc is greater than or equal to e. Let c\ be the maximum over density matrices in this 
set of 


Ej Pj B S(pa(j)) 
ma x(S(p A ),S(p c )' 


Note that the denominator is bounded away from zero on this set so that C\ is bounded. Since this set is compact, 
the maximum is achieved somewhere on the set. However, this implies that c\ < 1 by lemma 2.4 Choose the c in 
Proposition |2.3| to be the maximum of this c\ and the constant c from lemma [231 


3. ONE-DIMENSIONAL CHAIN 

We now consider a one-dimensional chain. We consider a line of qudits, all with the same Hilbert space dimension 
d. We consider a non-negative pure state on this line such that pab factors as pa ® Pb for any two disjoint regions 
A, B which have at least one qudit in between them. We refer to this as a system with “zero correlation length”. We 
number the qudits by integers, 1, ...,T, where we consider a line of L sites. 

First, we claim that given any pair, i, j with i < j, if we measure qudits i +1, i + 2,..., i + j — 1 in the computational 
basis, then the expected entanglement entropy between 1 ,...,* and after measurement is exponentially small 

in j — i. To see this, first consider any triple i, * + 1, i + 2. There is some given entanglement across the cuts i,i + 1 
and across the cuts i + l,i + 2. This entanglement entropy is at most log(d) across each cut. We wish to estimate the 
expected entanglement across the cut i,i + 2 after measuring the spin i + 1. To apply the results above, one might 
first imagine that we would treat 1, ...,i as system A and spins i + 2, ...,L as system C, while letting B be spin i. 
However, the results above required that A and C both have bounded Hilbert space dimension, and in this case the 
dimension of A, C would depend on L. To get around this difficulty, note that if we measure spin 1,...,* — 1 and 
spins * + 3,..., L then regardless of the measurement outcome, the entanglement across the cuts i,i + 1 and i + 2, i + 3 
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is unchanged. Further, the expected entanglement across i,i + 2 after measuring i + 1 (i.e., expected over different 
measurement outcomes on i + 1) is independent of the measurement outcome on 1, ...,z — 1 and * + 3, Indeed, 
we could simply project onto any state in the computational basis with nonzero amplitude on spins 1, ...,* — 1 and 
i + 3,..., L giving a wavefunction on the three remaining spins i, i + 1, * + 2. 

Then, we can apply the results above and show that the expected entanglement between i,i + 2 for this three site 
system is bounded by clog(d), where c < 1 is from Proposition 2.3 Hence, if we consider the original wavefunction 


(without projecting onto a state on spins 1, ...,* — 1 and i + 3, ...,P) we find that the expected entanglement across 
the cut i,i + 2 is bounded by clog(d). However, we can now repeat the process, considering the entanglement across 
the cut i, i + 3 after measuring spin i + 2. In this way we find that: 


Proposition 3.1. For any one-dimensional line of qudits, labelled 1, ...,P, all with dimension bounded by d, for any 
non-negative wavefunction on this line which has zero correlation length in the sense above, given any pair i,j, with 
i < j, the expected entanglement entropy between spins 1,..., i and spins j ,..., L after measurement of spins i+l ,..., j — 1 
in the computational basis is bounded by 


S(pi,...,i) < c? 1 1 log(d) 


(51) 


for the constant c < 1 from Proposition \2.S\ 

Let us write the wavefunction for this system as ^(oq, ..., oq,) where oq,..., oz, are values of the the spins in the 
computational basis. Let P(cti, ..., ojf) = |\P(cri,..., ctl )| 2 denote the classical probability distribution associated to 
the quantum wavefunction on this system. Given any two subsystems, A,B, the mutual information between the 
associated classical variables is bounded by that between the quantum variables. That is, if A c denotes the classical 
variable obtained by measuring A in the computational basis and B c denotes that obtained by measuring B in the 
computational basis, then I(A C -,B C ) < I(A;B). Applying this to the subsystems 1 ,...,* and of the above 

proposition, we find that the mutual information between (1, ...,i) c and (j, ...,L) C conditioned on (i + 1,..., j — l) c is 
bounded by c 7- ® -1 log(d), where we write (a,..., b) c to denote the classical variables obtained by measuring a, a+1,..., b 
in the computational basis. Hence, 


z) c ;j c |(z + 1,..., j - l) c ) < c 7 1 Mog (d). 


(52) 


So, by Pinsker’s inequality, the one-norm distance between the probability distribution P(cr 1 ,..., aj) and the probability 
distribution P(ai,..., <7j — l)P(<Tj+i,..., <jj)/P(cri+i, ..., j — 1) is bounded by 

\\P{au-,<rj) ~ - l)P(a i+1 ,...,aj)/P(a i+1 ,...,j - 1)|| +1 < log (d). (53) 

Hence, iterating this we find that 


Let 'PGibbs be defined as 


||P(ci,..., cr L ) 


'&Gibbs(o’l, ..., CT L ) = 



•Wi+i) I, 

nf=2 p (°r 

. IIJ- 

&i+l-1) 

Yl-ZlPi.cn, 

•••? &i-Yl) 

\ uUp(^- 

■) &i+l- 1) 


(54) 


(55) 


We refer to this as a “coherent Gibbs state”, since it can be written as 


^ L-l L—l 

^Gibbsio- 1 , ..., <7 l) = exp(- £n log(P(cr i , ..., cq_j-z)) ^ ^ ^ log(P(cq, •••, 

i= 1 i —2 


(56) 


and each term in the exponential can be regarded as some local classical Hamiltonian (with interaction range at most 
l ) and the exponential can then be regarded as a Gibbs state of this Hamiltonian. We have the general bound for any 
two mixed states p, a that are diagonal in the computational basis that 

teiyfpVo) > \\\P- °1Ii ( 57 ) 
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This bound is a special case of a bound in Ref. [20]. 

Applying this bound to the mixed states p,a corresponding to the classical probability distributions P(eri, ...,ctl) 


and 


nt7 .»■ 




, and using Eq. (541 we get that 

+i-i) 


('S’l'i'Gibbs) > ^ \Jc l log (d). 


(58) 


This construction of the coherent Gibbs states may remind the reader of the Hammersley-Clifford theorem, which 
states that for any classical probability distribution for spins on a graph, with the property that the measurement of 
all spins neighboring a given spin i will complete factorize the probability distribution between i and the rest of the 
system, and with the property that the probability distribution is nonzero on all configurations, then the probability 
distribution is the Gibbs state of a Hamiltonian that is a sum of terms supported on cliques of the given graph. Thus, 
it may seem surprising that we do not need to require that the probability distribution is nonzero here, as that is 
known to be a necessary condition 12 for the Hammersley-Clifford theorem. The resolution is that we have considered 
the system on a one-dimensional chain with open boundary conditions, which gives us a natural ordering of the spins, 
allowing us to iteratively build up the distribution on the first spin, then first two spins, and so on, as done above. 
So, the second spin will help “shield” the first spin from the rest of the chain (in our problem, this shielding is not 
perfect in contrast to the Hammersley-Clifford result), while if we had considered the problem with periodic boundary 
conditions, even under the stronger assumptions of Hammersley-Clifford the second spin alone would not shield the 
first spin from the rest of the chain, but we would also need to measure the L-th spin. 


4. DISCUSSION 

We have shown an intrinsic sign problem for commuting Hamiltonians with TQO-2 in the same phase as the double 
semion model. Further, we have shown several bounds on possible teleportation in systems with a non-negativity 
constraint. We have finally shown that the wavefunction has a coherent Gibbs state form for one-dimensional systems 
with zero correlation length. This form is based on showing that given any pairs of qudits i.j. if we measure qudits 
i + 1, i + 2,..., i + j — 1 in the computational basis, then i,j are exponentially close to decorrelated. If Proposition 
|2.3| could be improved to make the constant c independent of dc, then we would be able to prove a similar result 
in two dimensions, in this case showing that given any qudit ?, if we measure all other qudits within radius R of 
i, then i is approximately decorrelated from the rest of the system. This would then hopefully allow a proof of 
an approximate Gibbs state form for two dimensional systems with zero correlation length, a step toward the goal 
mentioned in the introduction of classifying topological order for positive states, although now because the problem 
is in two dimensions, the proof of the coherent Gibbs state would be much more difficult (in particular, we would 
not have such a natural ordering of spins as in one dimension; see the discussion above about shielding one spin from 
the rest of the chain). Thus, while at present the two main results in this paper (intrinsic sign problem for certain 
commuting Hamiltonians in two dimensions, and coherent Gibbs states for certain wavefunctions in one dimension) 
are not related to each other, perhaps if there is an extension of the coherent Gibbs state result to two dimensions 
then this will lead to a connection between these results and to a way of strengthening the first result. 

The general study of wavefunctions with this non-negativity constraint may be interesting. It might, for example, 
be easier to prove area law results for Hamiltonians if we assume that the ground state wavefunction has this property. 
It might be even easier if we assume that every term in the Hamiltonian has the property that all off-diagonal matrix 
elements are non-positive. In this regard, it is worth pointing out that even if the Hamiltonian has this property, there 
still need not be a “generalized area law” in the sense of Ref. [3] That paper used quantum expanders 13 ^ to construct 
a Hamiltonian on four sites on a line, with interactions between only neighboring sites. The dimensions of the middle 
two sites were fixed, but the dimensions of the end sites were parametrized by a parameter d. The Hamiltonian had 
uniformly (in d) upper bounded norm and lower bounded spectral gap, but had large (in fact, maximal, i.e. log(d)) 
entanglement between either end site and the rest of the chain. The Hamiltonian however did not have the property 
of having no sign problem. However, it is not difficult to modify the Hamiltonian so that it does not have a sign 
problem. We spend some time motivating the construction before writing the desired Hamiltonian. A first guess as 
to how to do this (this guess does not work) is to replace the unitary matrices Ui,U 2 ,U 3 which are used to define 
the quantum expander in that work by unitary matrices with non-negative entries. In fact, the only unitary matrices 
which have non-negative entries are permutation matrices, so we would be trying to construct a quantum expander 
defined by a channel of the form 

/c 

£ 0 ) = l^2 P aP P l, 


(59) 
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where in general we allow k (potentially more than 3) unitary matrices. However, such a channel cannot be a quantum 
expander because it has at least two linearly independent fixed points. One such fixed point is the maximally mixed 
state, which the other fixed point is the matrix p with all entries equal to 1/d; i.e., this second fixed point is the 
projector onto a vector with all entries equal to 1/vd. So, we need to modify this solution. Before giving the 
modification, we explicitly write down the Hamiltonian that would result from applying the construction of Ref. [3] to 
this channel. The Hamiltonian is a sum Hl + Hm + -Hr with the middle two sites having k states, with basis vectors 
|z), for i = 1,..., k, and with 


Hl = 1 - l P ' P l ® l^'l ( 6 °) 

* i,i '=1 

k 

H * = l ~\il\ iW\®(PiPl) T ( 61 ) 

i,i '=1 

k 

h m = J2( (62) 

a =2 

To use previous results from Ref. [2H we will take k even, and fix P a = Pl +k / 2 for 1 < a < k/2. First, note that for 
k > 4, for random choice of the permutations, there is only a two dimensional space of fixed points (spanned by the 
two states given above), with a gap to the rest of the spectrum of the map 8 ( p ) . This follows from the fact that the 
classical stochastic map which maps a probability distribution p (the entries of the vector denote the probabilities of 
the various states) by 


a —1 


(63) 


forms a 2-copy tensor product expander 21 ®. The quantum channel defined by (59) has exactly the same action on 


p as this 2-copy tensor product expander does on the joint probability distribution of two degrees of freedom. The 
construction of Ref. [^Quses at least four permutation matrices to do this, so using such permutation matrices requires 
a modification of the construction of Ref. to increase the dimension of the mid dle sites to k = 4 from 3. Hence, 
combining these results from Refs. 131211 the Hamiltonian given by Eqs. ( 60|61|62 1 has a spectral gap with a doubly 
degenerate ground state. This Hamiltonian is frustration free. 

However, this still does not give the desired result as we wish a unique, highly entangled ground state. Note however 
that we can find quantum channels, all of whose Krauss matrices have non-negative entries, such that there is a unique 
fixed point which is the maximally mixed state, with a gap to the rest of the spectrum. Define a projector n which 
projects onto the first d/2 coordinates. Consider the quantum channel 


£(p) = E Pa P p a + (i - <z)i¥n + (l - q)( l - n)p(i - n), (64) 

a =1 

for any q with 0 < q < 1. This has the desired property: by adding the second and third terms, the only fixed point is 
indeed the maximally mixed state, and using the claimed properties of the spectrum of the tensor product expander 
it is easy to show that there is a gap to the rest of the spectrum. 

However, such a quantum channel, which uses non-unitary matrices, does not exactly fit into the scheme of Ref. [3] 
We now write down a Hamiltonian which is motivated by this quantum channel which slightly generalizes this scheme. 
The middle sites will have dimension k + 2, with states labelled 1,2,..., k, A, B. The two states A, B in a sense will in 
a sense correspond to the two additional Krauss operators. The Hamiltonian is given by H' L + H' R + H' M with 

H' l =H l + A l , (65) 


where 


Al = n® (|1> - \A)m - |A»t + (1 — n) ® (|1) - |R))(|1) - IB))t 


(66) 
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and 


H' r = H r + A h, 


where 


(67) 


A* = (|1) - \A)m |A))t 0 n + (|1> - |B))(|1> - |B))t 0 (1 - H), (68) 


H' m = H m + \AB)(AB\ + \BA)(BA\. (69) 

To analyze this Hamiltonian, consider first the Hamiltonian H' L + H' R + Hm- Note that this is equal to Hl + H R + 
Hm + A r + A r . Both H r + H r + Hm is a frustration free Hamiltonian with a spectral gap (it is an easy exercise 
to verify that the addition of the states |A) and | B) increases the number of ground states but does not remove the 
spectral gap), so it is lower bounded by coQo for some constant Co > 0 and some projector Qq. Similarly, + Ar 
is lower bounded by C\Q\ for some C\ > 0 and some projector Q\. Hence, Hl + H R + Al + A R is lower bounded by 
min(co, Ci)(Qq + Qi)- We now use the following lemma about gaps for sums of projectors: 

Lemma 4.1. Let Qo,Qi be projectors. For any positive semi-definite operator O, let A(O) denote the smallest 
non-zero eigenvalue of O. Then, 

A(Q 0 + Qi) > -A^Qi + (1 — Qi)Qo(l — Qi)^j = A^(l — Qi)Qo(l — Qi)^j- (70) 

Proof. Applying Jordan’s lemma to the projectors Qo,Qi, we can bring both projectors to a block diagonal form 
with blocks of size at most 2. We show that this result holds in each block. Consider a block of size 2 in which 
each projector has one eigenvalue equal to zero and one equal to one (the case of a block of size 1 can be handled 
straightforwardly). Pick a basis in this block so that is Qi is 


and Qq being 



f cos(0) 2 cos(d) sin(0)\ 
l^cos^) sin(0) sin 2 (0) j 


(71) 


(72) 


A direct calculation gives that the smallest nonzero eigenvalue of Qq + Q\ in the given block is 1 — | cos(0)|, while the 
smallest nonzero eigenvalue of Q i + (1 — Qi)Qo(l — Qi) in the given block is sin 2 (0). Since 1 — | cos(0)| > ^ sin 2 (d) 
the result follows. _ 


To show this last inequality, note that it is equivalent to 1 
gives the desired result. 


| sin 2 (0) > cos(0) = y 1 — sin 2 (0). Squaring both sides 

□ 


Hence, using this lemma, we can restrict to the range of 1 — Q±, and show a gap for Qq restricted to this subspace. 
The range of 1 — Q\ can be obtained by starting with a space of states including only states 1,..., k on the middle two 
sites and applying an isometry I 0 |?)(+(l/v / 2)n 0 (|1) + |A))(1| + (1/^/2)(1 — n) 0 (|1) + |H))(1| to the first 
two sites and ^*=2 l®)(®7 + (|1) + |A))(1| 0 n + (|1) + |B))(1| 0 (1 — n). Call the product of these isometries V. We 
find that (Hl + H m T 7/ r) V 0 (1 /2] (// / -)- H -\-11 ) where, in an abuse of notation, the 11 / + Hm T H r on the 
left hand side acts on the space of states with k + 2 states on each middle site, while the Hl + Hm + H R on the right 
hand side acts on the space of states with only k states on each middle site. Hence, the results of Ref. [3] showing a 
spectral gap carry over to this case, and H' L + H' R + Hm has a doubly degenerate ground state with a spectral gap. 

Finally, we show that H' L + H' R + H' M has a unique ground state. Since we know that H' L + H R + Hm has a 
lower bound on the gap, it is lower bounded by a constant times the projector onto the ground state. Using the 
same lemma 4.1[ it suffices to consider the ground state subspace of H' L + H R + H R and show that the operator 
\AB)(AB\ + ]bA)(BA\ vanishes on one of the ground states, but not the other. Indeed, the unique ground state of 
H' l + H' r + H' m is highly entangled. 




23 


Appendix A: Feasible Solutions 

We consider for which dimensions d it is possible to have two d-dimensional vectors, v. w, with non-negative entries, 
with \v\? = \w \<2 = 1, and with given inner product (u|u>), such that 


M1H1 
>\? + W 


< (v\w). 


(Al) 


Let |1) denote the vector with all coefficients equal to 1/Vd. Write v as a linear combination v = cos(0)|l)-t-sin(0)u -L , 
for some angle 6 and some rA with (u _L |l) = 0 and |u _L | = 1. Similarly, write w as a linear combination w = 
cos(</)|l) -)- sin(</>)u;- L , where also \w ± \2 = 1 and (u;- L |l) = 0. Let z = (tr L |tu- L ). Then, we must satisfy 


> cos(d) cos(</) 

cos(d) 2 + cos (</>) 2 


< cos(d) cos{(/)) + sin(d) sin(</),z < cos(d) cos{4 >) + sin(d) sin(</), 


(A2) 


where without loss of generality we assume that sin(0) > 0, sin (cf>) > 0. 


Let 9 = p + 5 /2 and <j> = p — 8/2 for some angles p, 6 . Then, the left-hand side of Eq. (A2) is equal to 


(cos(p) cos(d/2) — sin(p) sin(d/2)) (cos(p) cos(<5/2) + sin(p) sin(<5/2) 

2—--/A-(A3) 

^cos(p) cos(d/2) — sin(p) sin(<5/2)^ + ^cos(p) cos(d/2) + sin(p) sin(d/2)^ 

cos(p) 2 cos(d/2) 2 — sin(p) 2 sin(<5/2) 2 
cos(p) 2 cos(d/2) 2 + sin(p) 2 sin(<5/2) 2 ' 

For any fixed 5, this is minimized by taking sin(p) 2 as large as possible. In this case, “as large as possible” encounters 
the constraint that for v,w having non-negative entries, we have cos(0) > l/y/d, cos(</>) > 1/y/d. Without loss of 
generality, assume 0 < </> < 9. So, we can assume that cos(d) = l/V d and so sin(d) = \J(d — 1 )/d. We have <f> = 9 —6. 
Hence, cos(</>) = cos(<5)/y / d + sin(d )\/(d — 1 ) /d. 

Taking this, the left-hand side of Eq. (|A2[) is equal to 


cos (6) + Vd — 1 sin(<5) 


= 2 - 


cos(d) + Vd — 1 sin(d) 


1 + cos(<5) 2 + (d — 1) sin(d) 2 + 2 \Jd— 1 cos(d) sin(d) 2 + (d — 2) sin(d) 2 + 2 \Jd— 1 cos(d) sin(d) ’ 


(A4) 


while the right-hand side is at most equal to cos(d). 

By a series analysis, one may verify that the inequality 

cos(<5) + \/d — 1 sin(d) 


2 + (d — 2) sin(d) 2 + 2 y/d— 1 cos(S) sin(d) 


< cos(d) 


(AS) 


has solutions with the left-hand side less than 1 for d > 2; this is done by considering the behavior for small S and 
showing that the left-hand side decreases more rapidly as a function of S to order S 2 . However, a numerical solution 
shows that for d = 3, the inequality can only be satisfied if the left-hand side is greater than 0.94.... An explicit pair 
of such vectors v, w can be written in the form v = (1, 0, 0), w = (Vl — 2e 2 , e, e) for appropriate e 

For larger d, however, the inequality can be satisfied at smaller values of the left-hand side. For fixed d, the left-hand 
side scales as 2/(\/dsin(d)) for large d. To make this as small as possible, one would like to take sin(d) large, thus the 
best asymptotics for the left-hand side we could hope to achieve is 2 /a/ d. In fact, we can achieve this asymptotics: pick 
sin(d) = \J\ — f(d) 2 and cos(d) = /(d) for any function /(d) which is asymptotically less than 1 but asymptotically 
greater than 2/y/d. 


Finally, we note that Eq. (A2) has no solution with the left-hand side less than 1 for d = 2. This can be seen by 


verifying that in this case the inequality (A5) becomes 

cos(d) + sin(d) 


1 + cos(d) sin(d) 


< cos(d), 


(A6) 


or equivalently, cos(d) + sin(d) < cos(d) + cos 2 (d) sin(d), which for sin(d), cos(d) > 0 can only be satisfied if cos(d) = 1. 
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